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2. Category and General Description. Is the invention a new process, composition of matter, a device, 
or one or more products? A new use for, or an improvement to, an existing product or process? 

The invention can be characterized as a new process. We have made a number of important observations 
that collectively defme novel methods in the identification of compounds that antagonize the ftmction of 
HAUSP. Because HAUSP is an MDM2-specific deubiquitinating enzyme and MDM2 contributes to 
several forms of cancer, inhibiting HAUSP may lead to the degradation of MDM2 and subsequent 
restoration of p53 function. 

3. Utility. What are possible uses for the invention? In addition to immediate applications, are there any 
other uses that might be realized in the future? 

This invention can be directly used for the identification of novel compounds that inhibit the function of 
HAUSP. 

4. Novelty. Pick out and expand on novel and unusual features. How does the invention differ from 
present technologies currently being used in industry? What problems does it solve, or what advantage 
does it possess? 

p53 is one of the most important tumor suppressor proteins and is inactivated in nearly all cases of 
cancer, hi approximately half of all cancer cases, p53 is inactivated by mutation; in the other half of all 
• cancer cases, wild-type p53 is inactivated by accelerated degradation or mis-localization. MDM2 is an 
oncoprotein that actively ubiquitinates p53, leading to its degradation. MDM2 also ubiquitinates itself, 
thus resulting in its own degradation. HAUSP is a member of the UBP family of deubiquitinating 



enzymes and selectively targets MDM2 and p53 for deubiquitination, hence rescuing both proteins from 
degradation. The specific deubiquitination relies on the specific binding of MDM2 and p53 by HAUSP. 

We discovered that HAUSP exhibits much higher binding affinity for MDM2 than for p53, 
implying that MDM2 is the bona fide substrate for HAUSP in cells. More importantly, we discovered that 
both MDM2 and p53 bind to the same surface pxjcket in the N- terminal TRAF-like domain of HAUSP. 
The binding element from MDM2 or p53 is a short peptide composed of no more than 12 amino acids. In 
addition, we determined a number of crystal structures on HAUSP, including the catalytic core domain, 
the catalytic core domain bound to ubiquitin aldehyde, the N-terminal TRAF-like domain, and the 
combined TRAF-like and catalytic core domain. These structiires reveal important insights on inhibitor 
screening. In particular, the TRAF-like domain contains a surface groove that is responsible for binding to 
the MDM2 or p53 peptide. The catalytic core domain contains a papain-like active site. The C-terminus of 
ubiquitin is specifically coordinated by a surface clefl in the catalytic core domain. 

5. Method of Synthesis, Assembly, or Process. If the invention is a composition of matter, a device, or a 
product, how is it made? If the invention is a process, what are the steps involved? 

(1) Screening for peptide-Iike small molecules that can bind to the TRAF-like domain of HAUSP; 

(2) Screening for inhibitors that bind to the active site of HAUSP or inhibitors that bind to the surface 
cleft of HAUSP (this surface cleft specifically binds to the C-terminus of ubiquitin); 

(3) Test the ability of these compounds to inhibit the deubiquitinating activity of HAUSP using 
polyubiquitinated MDM2 as a substrate; 

(4) Validate lead compounds in cancer cell lines and animal models; 

(5) Improve the efficacy of such compounds and launch clinical trials. 

6. Limitations. Does this invention possess disadvantages or limitations? Can they be overcome? How? 
There is no apparent limitations at present time. 

7. Experimental Verificatioo. Have you tested the invention experimentally? YES NO If yes, 

describe below. Have you constructed a prototype, model, or test samples which are available for 
examination? YES NO If yes, describe below. 

8. Publications. 

A) Have you described the invention in a publication or an oral presentation? What was the date of 
publiciation or presentation? Was it described in specific or general fashion? Include abstracts of talks, 
news stories, etc. Be sure to include copies of any publications, label them collectively as attachment A. 

A manuscript describing this invention will be submitted for publication in the near future. I have not 
made an oral presentation to the public. 

B) Any plarmed future publications or public disclosure? YES NO X If yes please include a date 

when the publication will go into the U.S. mail for distribution and the date any public disclosure such as 
a talk will take place. If there are attachments label as attachment B. 

9. References. Are there inventions or publications by others that are related to this invention? Please list 
on a separate sheet, and attach copies if available. Label this list and attach collectively as attachment C. 

Maybe. But this invention is very unique and should be distinct firom all other inventions. 



10. Sponsorship. Was the work that led to the invention sponsored? List sponsors below and attach 
copies of contracts or grant agreements if available. Label these collectively as attachment D. 

Sponsor Contract or Grant Check if copy 

Number provided 

a) Nffl 126-6087 



11. MTAs. Was the invention made with any material or biological substance obtained through a Material 
Transfer Agreement? Please provide the name of the provider, their affiliation and a copy of the 
Agreement. 

No. 

12. Abstract Please provide a brief (less than one page) abstract of the invention. NOTE, THIS 
ABSTRACT IS INTENDED FOR MARKETING PURPOSES UNDER NONCONFIDENTL^L 
SITUATIONS, While it is important to include the novel nature of the invention, a general description, 
intended or proposed uses and utility, the abstract must be a NON-ENABLING DISCLOSURE. Please 
DO NOT include the essential elements which would allow someone to practice the invention or 
reproduce the material without a license. 

Cancer is one of the most debilitating diseases affecting mankind. More than 600 thousand 
patients die of cancer in the United States. Oncogenesis is inhibited by the tumor suppressor genes and 
accelerated by oncogenes. p53 is one of the most important tumor suppressor proteins and is inactivated in 
nearly all cases of cancer. In approximately half of all cancer cases, p53 is inactivated by mutation; in the 
other half of all cancer cases» wild-type p53 is inactivated by accelerated degradation or mis-localization, 
thus unable to perform its function. MDM2 is an oncoprotein that actively ubiquitinates p53, leading to its 
degradation. MDM2 also ubiquitinates itself, thus resulting in its own degradation. Restoration of p53 
function in cancer patients constitutes an important and effective therapeutic method. 

HAUSP is a member of the UBP family of deubiquitinating enzymes and selectively targets 
MDM2 and p53 for deubiquitination, hence rescuing both proteins fh>m degradation. The specific 
deubiquitination relies on the specific binding of MDM2 and p53 by HAUSP. 

We discovered that HAUSP exhibits a much higher binding affinity for MDM2 than for p53, 
implying that MDM2 is a bona fide substrate for HAUSP in cells. More importantly, we discovered that 
both MDM2 and p53 bind to the same surface pocket in the N-terminal TRAF-like domain of HAUSP. 
The binding element fi-om MDM2 or p53 is a short peptide composed of no more than 12 amino acids. In 
addition, we determined a number of crystal structures on HAUSP, including the catalytic core domain, 
the catalytic core domain bound to ubiquitin aldehyde, the N-terminal TRAF-like domain, and the 
combined TRAF-like and catalytic core domain. These structures reveal important insights on inhibitor 
screening. In particular, the TRAF-like domain contains a surface groove that is responsible for binding to 
the MDM2 or p53 peptide. The catalytic core domain contains a papain-like active site. The C-temiinus of 
ubiquitin is specifically coordinated by a surface cleft in the catalytic core domain. 



Our novel findings immediately identify methods for the screening of anti-HAUSP compounds. 
The first method involves screening for compounds that bind to the surface groove of the TRAF-like 
domain. These compounds will prevent the specific recognition of MDM2 by HAUSP, hence accelerating 
the degradation of MDM2. The second method involves screening for compounds that bind to the surface 
clefl of the catalytic core domain, which normally coordinates the C-terminus of ubiquitin. These 
compounds will interfere with the deubiquitinating activity of HAUSP. The third method involves 
screening for compoimds that directly bind to the active site of HAUSP, hence suppressing its catalytic 
activity. 

13. Possible Means of Commercialization. How do you envision that the invention might be used in a 
commercial product or a process for producing a product? Feel free to include as many possibilities as you 
can. What advantages does this invention have over existing technologies? 

Our biochemical and structural information will most likely be explored by pharmaceutical companies for 
the rational design or screening of drugs. These drugs may be used to treat cancer. 

14. Potential Licensees. Have you described the invention to industry representatives? Did you describe it 
in a specific or a general fashion? Did they express any interest? Name them and specific individuals and 
their titles. Do you know of any other firms that might be particularly interested? 



Not yet. 
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HAUSP recognizes multiple substrates through its N 
terminal domain 



Introduction 

With hundreds of members, the UBP family of deubiquitinating enzymes has 
become one of the largest enzyme families known to date. For this large family of 
enzymes, substrate specificity has always been an important yet unresolved 
issue. The fact that most UBPs contain divergent sequences at their N and/or C 
terminus strongly indicates that UBPs may have their own specific substrates 
and the divergent N- or C-terminal sequences are likely to be involved in the 
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recognition of those substrates. Studies have also shown that, in higher 
eukaryotes, UBPs are involved in the regulation of various biological processes, 
including eye development, cell growth, oncogenic transformation and cell cycle 
regulation. Despite strong evidence suggesting that UBPs may have specialized 
regulatory functions in multicellular organisms, only very few UBPs with specific 
substrates have been identified, among which HAUSP is the first mammalian 
example. 

In 1997, HAUSP was first identified as a cellular interacting protein with 
the herpes simplex virus type I immediate early gene product. Vmw110 (also 
called ICPO), and was then named HAUSP, which stands for herpes simplex 
virus associated ubiquitin specific protease. Vmw110 specifically binds to 
HAUSP C-terminal domain. In addition to VmwIlO, another viral protein EBNA1 
(Epstein-Barr virus nuclear antigen 1) was also identified to interact with HAUSP. 
Further studies show that the N-terminal domain of HAUSP is responsible for 
EBNA1 binding and an EBNA1 peptide of residues 395-450 was sufficient for 
binding to HAUSP. As will be mentioned later, the HAUSP N-terminal domain, 
which is responsible for EBNA1 binding, is also responsible for p53 recognition. 
The fact that EBNA1 binds to HAUSP N-terminal domain and efficiently 
competes off p53 peptide suggests that EBNA1 might also affect p53 function in 
vivo by competing for HAUSP. 

What makes HAUSP an even more important regulatory protein in cells is 
the finding that HAUSP can deubiquitinate both p53 and Mdm2, and thus may 
play a very important regulatory role in the p53-Mdm2 pathway. 
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The p53 tumor suppressor protein is a sequence-specific transcription 
factor that can respond to a wide variety of cellular stress signals. The proper 
function of p53 is indispensable to many cellular processes such as DNA repair, 
cell cycle arrest, and apoptosis. Loss of p53 function has been observed in 
almost all tumor types and the gene is mutated in more than 50% of tumors, 
making it the most commonly mutated gene in human cancer. Mdm2 was later 
identified to be the major mediator for p53 ubiquitination in normal ceils. Mdm2 
was first identified as the product of a gene amplified in a spontaneously 
transformed murine cell line, and was found to be able to inhibit the 
transcriptional activation activity of p53 by binding to its transactivation domain. 
Later, it was also found that Mdm2 can promote the ubiquitination and 
degradation of p53 both in vivo and in vitro. Further studies demonstrated that 
Mdm2 is a RING finger E3 ubiquitin ligase, and its C-temriinal RING finger motif 
can not only promote p53 ubiquitination both in vivo and in vitro, but also target 
Mdm2 for autoubiquitination and subsequent degradation. One interesting 
feature of the Mdm2-p53 regulatory pair is the existence of an autoregulatory 
feedback loop between Mdm2 and p53. In addition to mediating p53 
ubiquitination, Mdm2 is also a negative regulator of p53's transcriptional activity, 
but on the other hand, the Mdm2 gene itself is one of the transcriptional targets 
of p53, so the activation of Mclm2 gene by p53 would automatically lead to the 
repression of p53 function. 

The Mdm2-p53 pathway is one of the most important pathways involved in 
p53 regulation. Previous studies imply that multiple mechanisms are involved in 
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the stabilization of p53 througli regulation of the Mdm2-p53 pathway. For 
example, both phosphorylation and acetylation events can promote p53 
stabilization. Furthermore, in response to oncogenic activation, the pi tumor 
suppressor can also stabilize p53 through the inhibition of Mdm2 ubiquitin iigase 
activity. Besides, the profound effect of ubiquitination on the p53-Mdm2 pathway 
clearly provides an opportunity for deubiquitination to play a role. Indeed, in year 
2002, HAUSP was found to be a novel p53-interacting protein. HAUSP can 
directly deubiquitinate p53 both in vivo and in vitro, and this deubiquitination 
function strongly stabilizes p53 and promotes p53<lependent apoptosis. 

More recently, the picture of HAUSP's function in the p53-MDM2 pathway 
became more complex with the new discovery that HAUSP exhibits a robust 
deubiquitinating activity toward Mdm2. Just like p53. MDM2 can also be 
efficiently deubiquitinated both in vitro and in vivo by HAUSP, and reduction of 
HAUSP by transient RNAi leads to a decreased half-life in Mdm2. This 
observation suggests that inactivating HAUSP may have important application in 
anti-cancer therapies, as it will down-regulate Mdm2. Nonetheless, the exact 
regulatory mechanisms of HAUSP in the p53-Mdm2 pathway remain 
incompletely defined, and further structural and/or biochemical characterization 
of the recognition between HAUSP. p53 and Mdm2 will certainly provide more 
insights into the real function of HAUSP, and help to elucidate the complex 
regulatory mechanism in the whole inter-dependent network. 
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Results 

HAUSP recognizes the C terminus of p53 

p53 contains an N-terminal transactivation donnain (residues15-30), a 
central DNA-binding core domain (residues 94-292), a tetrannerization donnain 
(residues 326-355), and a C-tenminal regulatory domain (residues 356-393). It 
has been reported that the N-terminal domain (residues 1-248) of HAUSP was 
sufficient for binding to p53- To further define the domain boundary, we 
generated a series of deletion variants of HAUSP, expressed and purified these 
recombinant proteins. We examined their ability to interact with p53 by gel 
filtration. Based on gel-filtration results, a protease-resistant fragment (residues 
53-208) of HAUSP is both necessary and sufficient for stable interactions with 
p53, while neither the core domain (residues 208-560) nor the C-terminal domain 
(residues 560-1 102) formed a stable complex with p53 (Figure 1). 

To identify the minimal sequence requirement in p53, we generated a 
number of deletion variants and assayed their interaction with HAUSP (residues 
53-208) using gel filtration (Figure 1A). Neither the DNA-binding core domain nor 
the oligomerization domain of p53 is required for the formation of a stable 
complex with HAUSP. In contrast, a C-terminal 32-residue peptide of p53 
(residues 351-382) is sufficient for the interaction with HAUSP (Figures IB & C), 
In particular, a short 11 -residue peptide stretch (residues 357-367) of p53 plays a 
critical role in binding to HAUSP as removal of this sequence from p53 (325-356) 
crippled its interactions with HAUSP (Figure IB). 
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Figure 1 The N-termtnal doniain of HAUSP recognizes the C-terminal peptide sequences of 
p53. 

(A) Identification of the N-termlnal domain of HAUSP as the primary p53 binding motif. 
Various purified recombinant HAUSP fragments were individually incubated with a large p53 
fragment (residue 94-393) and then subjected to gel-filtration analysis. The results are 
summarized. 

(B) A minimai p53 fragment is both necessary and sufficient for HAUSP binding. p53 
peptides with various length and boundaries were purified individually and were tested for binding 
with the HAUSP N-terminal domain (residues 53-208) by ge)-filtration. The results are 
summarized. 

(C) A representative gel-filtration run for the complex between HAUSP (residues 53-208) 
and p53 (residues 351-382). Aliquots of the fractions were visualized by Coomassie staining 
following SDS-PAGE. The p63 (residues 351-382) peptide, which in isolation was eluted from gel- 
filtration in fractions 38-39, was eluted in fraction 31-33 when complexed with HAUSP (residues 
53-208). 



These analyses demonstrate that the N-terminal domain (53-208) of 
HAUSP stably interacts with a minimal C-temninai peptide (357-382) of p53 
(Figure 1B). Supporting this conclusion, the N-terminal domain of HAUSP 
(residues 58-196) was found to share significant homology (up to 32% sequence 
identity) to the TRAP (TNF receptor associated factor) domain, a known peptide- 
blnding motif. 

HAUSP binds to a short peptide motif of EBNA1 

Full-length EBNA1 protein contains 641 amino acids. The identified 
functional domains of EBNA1 include a long N-terminal Gly-Ala repeat (GAr), a 
nuclear localization signal, a dimerization domain, and a DNA-binding domain. 
The GAr in EBNA1 spans a sequence of 238 amino acids (residue 91-328). It 
helps to block antigen presentation through inhibition of the ubiquitin-dependent 
proteolysis of EBNA1. A 56 amino-acid region (residues 395-450) in EBNA1 has 
been shown to be both necessary and sufficient to bind to HAUSP. To further 
define the HAUSP-interacting sequence in EBNA1, we first generated two 
deletion variants for the previously identified HAUSP-interacting region (residues 
395-450), with one retaining the N-terminal half sequence and the other retaining 
the C-terminal half sequence. Both deletion variants were purified to 
homogeneity in their GST-tagged form, and were reloaded to GS4B columns to 
generate EBNA1 affinity columns. The purified HAUSP N-terminal domain 
protein (residue 53-206), which was previously shown to interact with EBNA1, 
was loaded onto the EBNA1 affinity columns and was subjected to extensive 
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washing afterwards. The binding of EBNA1 deletion variants to HAUSP were 
assayed through their ability to retain HAUSP N-terminal domain protein on the 
affinity column. The EBNA1 affinity chromatography results indicate that the N- 
terminal half of EBNA1 (395-450) doesn't bind to HAUSP N-terminal domain, 
while the C-terminal half sequence (residues 422-450) is both necessary and 
sufficient to bind to HAUSP N-terminal domain (Figure 2). 
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Figure 2 HAUSP N-termlnal domain binds to EBNA1 (422^50) peptide. EBNA1 affinity 
columns were constaicted by loading purified GST-EBNA1 peptide proteins to GS4B affinity 
columns (lane 2 and Lane 7). Purified HAUSP N-terminal domain was loaded to both GST- 
EBNA1 (395-422) and GST-(422-450) columns (Lane 1), and was washed extensively (lane 4 
and Lane 9). Protein sample on both columns were eluted off using GSH. The eluate from the 
GST-EBNA1 (395-422) affinity column contained no HAUSP N-termlnal domain protein (Lane 6), 
while a protein band corresponding to HAUSP N-terminal domain was detected in the eluate from 
GST-EBNA1 (422-450) column (lane 1 0). 



The results derived from EBNA1 affinity chromatography were further 
confirmed by gel-filtration chromatography experiment (Figure 3). 
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Figure 3 Gel-fiitration runs for the binding test between HAUSP N-terminal donnain and 
EBNA1 peptides. Left panel shows the gel-filtratlon chromatography of HAUSP (53-206) by itself 
(top), HAUSP / EBNA1 (395-422) mixture (middle), and HAUSP / EBNA1 (422-450) mixture. 
Aliquots of the fractions for each gel-filtration run were visualized by Coomassie staining following 
SDS-PAGE. Due to the lack of positively-charged residues. EBNA1 (422^50) peptide does not 
get stained by Coomassie. 



According to the gel-filtration results, the EBNA1 (residues 395-422) 
peptide does not form a stable complex with the HAUSP N-terminal domain in 
solution, and was eluted separately from HAUSP N-terminal domain on gel- 
filtration. In contrast, the EBNA1 (residues 422-450) can form a stable complex 



with the HAUSP N-terminal domain, and the complex was eluted from gel- 
filtration one fraction earlier that HAUSP N-terminal domain by itself. 

To further delineate the minimal HAUSP-binding sequence in EBNA1 , we 
carried out another round of EBNA1 affinity chromatography to test the HAUSP 
binding ability of a series of deletion variants within the previously identified 
HAUSP binding sequence (residues 422-450). In order to avoid potential steric 
interference from the GST moiety, a flexible linker of 35 amino acids was added 
in between GST and EBNA1 peptide for each GST-fusion EBNA1 construct. 
Aliquots of loading samples, wash and eluates were run on SDS-PAGE (Figure 

4). 
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Figure 4 EBNA1 affinity chromatography. GST-fusion EBNA1 peptides with different length 
were loaded to individual GS4B columns. Purified HAUSP N-terminal domain (53-206) was 
loaded to individual GST-EBNA1 columns (lane 2. lane 5 and lane 8). washed extensively (lane 
3, lane 6, and lane 9), and eluted with GSH (lane 4, tane 7, lane 10). 
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According to the second-round EBNA1 affinity chromatography results, no 
HAUSP N-terminal domain protein was bound to GST-EBNA1 (422-442). 
indicating that this part of the sequence may not be involved in HAUSP binding. 
In contrast, a substantial amount of HAUSP N-terminal domain protein was 
detected in the eluate from GST-EBNA1 (441-450) column, indicating that this is 
the most critical sequence for HAUSP binding. A faint band of HAUSP N-terminal 
domain was also seen in the eluate sample from GST-EBNA1 (446-450). which 
suggests that there is also binding between HAUSP and EBNA1 (446-450). 
However, compared to the binding between HAUSP and EBNA1 (441-450), 
EBNA1 (446-450) binds to HAUSP much less tightly, indicating that further 
truncation after residue 441 may have removed part of HAUSP binding 
sequence. 

Combining the data from both affinity chromatography and gel-filtration, a 
minimal sequence of 10 amino acids in EBNA1 (resides 441-450) has been 
shown to play an essential role in binding to HAUSP N-terminal domain. 

HAUSP recognizes a central acidic domain of I\/Idm2 

The full-length human Mdm2 contains 491 amino acids. According to the 
sequence alignment of Mdm2 from different species, the full-length Mdm2 can be 
divided into four major conserved regions (Figure 5). Region I is at the N 
terminus of Mdm2, containing about 90 amino acids without obvious sequence 
similarity to known domain identities. This region contains the p53-binding 
domain. Region II Is a highly acidic region with 62% sequence homology across 



species. This region has been shown to be involved in the binding with multiple 
proteins, including p14ARF, the inhibitor of Mdm2, and L5, a component of the 
large ribosomal subunit. Region III is a potential zinc finger domain with 58% 
sequence homology. The exact function of this domain still remains unclear. 
Region IV is a highly conserved RING finger domain with 83% sequence 
homology. This domain is responsible for the E3 ubiquitin ligase activity of 
Mdm2. A short basic sequence between region I and region II with a potential 
nuclear localization signal is also highly conserved. 
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Figure 5 Schematic representation of the domain compositions of human Mdm2 protein. 

Full-length Mdm2 protein contains four major domains (I - IV). The boundary residues for each 
domain are Indicated. 

To study the recognition between Mdm2 and HAUSP, we generated a 
series of deletion constructs for both Mdm2 and HAUSP, expressed and purified 
these recombinant proteins and tested their ability to interact with each other, 
using gel filtration. According to gel-filtration results, just as in the case of p53 
interaction assay, the N-terminal domain of HAUSP is both necessary and 
sufficient for stable interactions with Mdm2. Neither the core domain (208-560) 
nor the C-terminal domain is involved in the binding with Mdm2 (Figure 6A). For 
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Mdm2, an 82 amino acid sequence (208-289) in the acidic region is sufficient to 



bind to HAUSP N-terminal domain (Figure 6B). 
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Figure 6 The N-terminal domain of HAUSP recognizes the acidic region sequence of 
IVIdm2. (A) Identification of the N-terminal domain of HAUSP as the primary Mdm2 binding motif. 
Various purified recombinant HAUSP domain constructs were individually incubated with a large 
fragment of Mdm2 (residues 170-423) and then subjected to a gel-filtratlon analysis. The results 
are summarized. (B) Identification of an Mdm2 fragment including the acidic region (residues 208- 
289) that is sufficient for HAUSP recognition. 

As previously demonstrated, the p53 peptide (residues 351-382) can fonm 
a stable complex with the HAUSP N-terminal domain in solution. However, in the 
presence of Mdm2, the binding between p53 and HAUSP was completely 
abolished (Figure 7). In the gel-filtration binding assay for HAUSP, Mdm2 and 
p53, Mdm2 (residues 208-289) efficiently competed off p53 peptide (residues 
351-382), and formed a stable complex with HAUSP N-terminal domain 
(residues 208-289). The currently identified HAUSP recognition motif of Mdm2 
includes the central acidic domain of Mdm2. This central acidic domain of Mdm2 
contains 37% acidic residues in mouse and human. 
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Figure 7 Representative gel-filtration runs for the binding assay between HAUSP and 
Mdm2. Left panel shows the gel-filtration chromatography of HAUSP (1-206) by itself (top), 
HAUSP / Mdm2 (208-289) mixture (middle), and HAUSP / Mdm2 (208-289) / p53 (351-382) 
mixture (bottom). Aliquots of the fractions for each gel-filtration run were visualized by Coomassle 
staining following SDS-PAGE. A representative SDS-PAGE gel for the complex between HAUSP 
and p53 is also shown on bottom right. The HAUSP N-terminal domain (residues 1-206) alone 
was eluted from gel-filtration with the peak fraction at 32 (1^ gel, from top), and was eluted with 
the peak fraction at 29 when complexed with Mdm2 (208-289) (2"^* gel from top). p53 peptide 
(residues 351-382) itself can form a complex with HAUSP N-terminal domain (residues 1-206). 
and the complex was eluted with the peak fraction at 31 (4*^ gel from top). However. In the 
HAUSP (residues 1-206), Mdm2 (residues 208-289) and p53 (351-382) tertiary mixture, the p53- 
HAUSP interaction was disrupted and only Mdm2 and HAUSP formed a complex. 
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It has been reported that this acidic region of Mdm2 also interacts 
specifically with L5, a component of the large ribosomal subunit, and thus is 
implicated in ribosome biosynthesis or In translational regulation. In addition to L5 
binding, the central acidic region of Mdm2 is essential for ARF binding. ARF 
(p14ARF and pi 9 ARF of human and mouse, respectively) is a small basic 
protein encoded by the INK4a locus, which also encodes the cyclin-dependent 
kinase inhibitor p16INK4a. Soon after its identification, ARF was found to interact 
with Mdm2 and to block Mdm2-mediated ubiquitination of p53. A truncated 
human Mdm2 containing residues 208-491 was able to interact with human 
ARF, and deletion of residues 222-437 completely abolished its binding to ARF. 
Further deletion analysis has showed that an acidic domain overlapping region 
(residues 210-244) of human Mdm2 is most critical for the ARF binding activity. 
Our finding that the acidic domain is also involved in the recognition with HAUSP 
adds an additional function to this domain. 

HAUSP N-temninal domain adopts a TRAF-like structure 

In addition to the biochemical analyses of the binding between HAUSP 
and its substrates, a structural approach was used to further study the substrate 
recognition by HAUSP. Given the fact that the N-terminal domain of HAUSP is 
involved in the recognition of three out of the four currently identified HAUSP- 
interactlng proteins, the structure of the HAUSP N-terminal domain alone 
became the first focus. BLAST search results for HAUSP N-terminal domain 
shows that the HAUSP N-terminal domain shares 34% sequence identity with the 
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TRAP domain (TD) of human TRAF2. Tumor necrosis factor (TNF) receptor- 
associated factors (TRAFs) are a family of adapter proteins that were first 
identified for their ability to bind to TNF family receptors (TNFRs). TRAFs can 
regulate the functions of the TNFR superfamily by linking the cytosolic domain of 
those receptors to downstream regulators such as protein kinases or ubiquitin 
ligases. Up to now. six members of the TRAF family have been identified in 
human and mice, and two TRAFs have been identified in Drosophila. All known 
TRAFs share a highly conserved region near their C-terminus, which is referred 
to as the "TRAF domain (TD)". The TRAF domain represents a novel protein fold 
of an anti-parallel p-sandwich structure and it usually consists of a sequence of 
about 180 amino acids. It is well known that TRAFs regulate the function of TNF 
family receptors mainly through binding to the specific peptidyl motifs in the 
cytosolic domains of the receptors. The functional specificity of TRAFs arises 
from the different peptide motifs they recognize. 

To elucidate the mechanism of substrate recognition by HAUSP, we 
crystallized the N-terminal domain of HAUSP (53-206) and determined its 
structure at 1.6 A resolution using multi-wavelength anomalous dispersion (Table 
1). Consistent with sequence analysis, the HAUSP N-terminal domain adopts a 
TRAF-like structure (Figure 8). It fomris a nine-stranded antiparallel p-sandwich. 
with strands pi, p9, p5 and p6 in one sheet and p2. p3, p4, p7 and p8 in the 
other. 

Table 1 . Summary of Crystallographic Analysis 
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The topology and overall structure of HAUSP N-terminal domain closely 
resembles those of the TRAP domain structure. The HAUSP N-termlnal domain 
can be superimposed on the TRAP domain of human TRAF2 structure with an 
rmsd of 1.48 A for 101 aligned backbone Ca atoms. There is a shallow surface 
depression in the middle portion of one side of the p-sandwich stnjcture (Figure 
9). In TRAP domains, this surface crevice is the region responsible for peptide 
recognition. In agreement with this feature, our binding assay results have shown 
that all three substrates known to interact with the HAUSP N-terminal domain 
use a peptidyl motif for the interaction. 
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Figure 8 The overall structure of HAUSP N-terminal domain. The HAUSP N-terminal domain 
adopts a TRAF-like structure. It forms a nine-stranded antiparallel sandwich, with strands p1, P9, 
pS and p6 in one sheet and p2. p3. P4. p7 and p8 in the other. A two-turn helix is present in the 
crossover connection between strands pS and p9. 
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Figure 9 Structural comparison of HAUSP N*terminal domain with the TRAP domain of 
human TRAF2. The HAUSP N-temiinal domain (in pink) can be super-imposed to the TRAP 
domain of human TRAF2 structure (in white) with an rmsd of 1 .48 A for 101 aligned backbone Ca 
atoms. Both proteins adopt an antiparallel fi sandwich structures. 

Discussion 

Since its identification in 1997, HAUSP has been shown to interact with up 
to four proteins, each of which has veiy important regulatory functions in the cell. 
There are multiple reasons to study the recognition mechanism between HAUSP 
and its substrates. One arises from the fact that, to date, the issue of substrate 
specificity of the whole UBP family deubiquitinating enzymes has remained 
elusive and not well studied. Even though there have been hypotheses claiming 
that the divergent sequences at the N or C terminus of UBPs endow high 
substrate specificity, no well-studied example has yet been presented. Being one 
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of the few UBPs with substrates identified, HAUSP makes a very good target for 
further studies of the substrate recognition mechanism of the UBP family 
enzymes. In addition, the fact that ail the identified HAUSP-interacting proteins 
are important regulators in various biological processes makes the study of 
substrate recognition by HAUSP very important. 

Using gel-filtration, we first identified a protease-resistant fragment of 
HAUSP N-terminal domain (residues 53-208) to be both necessary and sufficient 
for p53 binding. In a similar approach, we also identified a C-terminal peptide 
sequence of p53 as necessary and sufficient for binding to the N-terminal p53- 
recognition motif of HAUSP. The minimal peptide (residues 357-382) comprised 
only 26 amino acids (Figure 1), but it contains five of the six putative 
ubiquitination sites. The extensive overlap of these multiple ubiquitination sites 
with the HAUSP binding site imposes important stmctural constraints on HAUSP. 
First, the HAUSP p53-recognition domain must be in close proximity to the active 
site cleft of the catalytic domain. Second, to accommodate these multiple 
ubiquitin attachment sites, the connection between the HAUSP catalytic core 
domain and the p53 binding domain must be flexible in order to position each 
ubiquitin-p53 linkage within the active site for nucleophilic attack by the catalytic 
Cys. This organization is likely to be critical for the efficient deubiquitination of 
p53 in vivo. 

The recognition between HAUSP and EBNA1 has been previously 
characterized. According to literature. EBNA1 binds to the N-terminal domain of 
HAUSP, the same domain where p53 binds. A 46 amino acid sequence 
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(residues 395-450) of EBNA1 has been shown to be both necessary and 
sufficient for HAUSP recognition. In our study, we used a combined approach of 
both EBNA1 affinity chromatography and gel-filtration to further define the 
minimal sequence requirement of EBNA1 for HAUSP binding. Based on our 
results, a short peptide motif of 10 amino acids (residues 441-450) in EBNA1 
plays the most critical role for HAUSP binding, and since further deletion of the 
peptide severely reduced the binding affinity between EBNA1 and HAUSP, it 
could be the minimal sequence required for HAUSP recognition. The fact that 
EBNA1 and p53 interact with the same domain of HAUSP raised the possibility 
that EBNA1 may interfere with the binding between p53 and HAUSP, and indeed 
this is the case. EBNA1 binds to the N-tenminal domain of HAUSP almost 10 
times stronger than p53, and the HAUSP-EBNA1 complex fomiation strongly 
inhibited the ability of p53 to bind HAUSP. The recognition between p53 and 
HAUSP has been shown to play an important role in stabilizing p53, and hence 
inhibiting cell cycle progression and inducing apoptosis. The fact that EBNA1 and 
p53 interact with the same domain of HAUSP and EBNA1 recognition strongly 
inhibits p53 binding raised the possibility that EBNA1 may interfere with cellular 
processes in host cells by disrupting the HAUSP-dependent deubiquitination of 
p53. By destabilizing p53, EBNA1 would be expected to promote cell cycle 
progression and prevent apoptosis, which could be important for the host cell 
immortalization. Epstein-Barr virus efficiently immortalizes cells during its latent 
infectious cycle, and this process involves a few different Epstein-Barr virus 
latency proteins. It still remains unclear whether EBNA1 plays a direct role in host 
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cell immortalization, but the findings that transgenic mice expressing EBNA1 
have a tendency to develop B-cell lymphomas suggest that EBNA1 may play a 
role in those processes. It still remains as a hypothesis that EBNA1 can 
contribute to host cell immortalization by Epstein-Barr virus through sequestering 
HAUSP, thereby destabilizing p53. Further studies will be needed to give a 
clearer answer. 

Beside p53 and EBNA1 . we also identified that the N-terminal domain of 
HAUSP is responsible for Mdm2 binding. Due to the difficulty of expressing full- 
length Mdm2. a long fragment of Mdm2 (residues 170-432) was used to identify 
the Mdm2-binding region in HAUSP. By gel-filtration assays, we showed that 
neither the catalytic core domain nor the C-terminal is able to bind to the long 
fragment of Mdm2. The HAUSP-binding region of Mdm2 was localized to the 
central acidic region (residues 208-289). Our binding assay results also showed 
that, like EBNA1 , Mdm2 can efficiently compete off p53 peptide from binding to 
HAUSP. It has been previously demonstrated that HAUSP can efficiently 
deubiquitinate p53 and Mdm2, both in vivo and in vitro. Based on these findings, 
a dynamic role of HAUSP in the p53-Mdm2 pathway was proposed, which claims 
that a kind of substrate balance for HAUSP is achieved under physiological 
conditions: under certain conditions, HAUSP may exclusively deubiquitinate p53, 
while under other conditions, Mdm2 may be the major substrate for HAUSP. Our 
finding that Mdm2 can form a stable complex with HAUSP, and that this complex 
formation efficiently abolishes p53 binding to HAUSP, strongly argues that in vitro 
Mdm2 is a preferred substrate for HAUSP. It has been shown that Mdm2 uses 



-23- 



multiple mechanisms to regulate p53 functions, including targeting p53 for 
ubiquitination and sequestering p53 transactivation activity. Our findings also 
raised a possibility that in addition to the previously identified regulatory 
mechanisms, Mdm2 may also be able to destabilize p53 through sequestering 
HAUSP's deubiquitinating activity on p53. 

The HAUSP-binding motif that we identified in Mdm2 includes a sequence 
motif (residues 210-244) that has been shown to be essential for binding to ARF. 
ARF protein is previously shown to be a Mdm2 negative regulator. ARF can 
function to stabilize p53 by blocking the E3 ubiquitin ligase activity of Mdm2. Our 
finding that the ARF binding motif in Mdm2 overlaps the HAUSP-binding region 
raised the interesting possibility of an interplay network between Mdm2, HAUSP, 
and ARF. Further characterization of the interactions between Mdm2 and 
HAUSP and between Md.m2 and ARF will help to clarify the real function of 
HAUSP in the ARF-Mdm2 pathway. 

As a further effort to study the substrate recognition mechanism of the N- 
tenminal domain of HAUSP, the crystal structure of the HAUSP N-terminal 
domain was solved. The HAUSP N-terminal domain adopts a nine-stranded 
antiparallel p-sandwich structure, which closely resembles the TRAF domain 
(TD) architecture. In the HAUSP N-terminal structure, there is a shallow surface 
concave at one side of the p-sandwich. In TRAF structures, a similar surface 
depression is involved in the recognition of specific peptides. This feature 
strongly supports our conclusion from biochemical binding assays that the nature 
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of the recx)gnition between HAUSP and its substrates is likely to be peptide- 
protein interaction. 
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Summary 

The ubiquitin-specific processing protease (UBP) fam- 
ily of deubiquitinating enzymes plays an essential role 
In numerous cellular processes. HAUSP, a representa- 
tive UBP, specifically deubiquitinates and hence stabi- 
lizes the tumor suppressor protein p53. Here, we re- 
port the crystal structures of the 40 kDa catalytic core 
domain of HAUSP in isolation and In complex with 
ubiquitin aldehyde. These studies reveal that the UBP 
deubiquitinating enzymes exhibit a conserved three- 
domain architecture, comprising Fingers, Palm, and 
Thumb. The leaving ubiquitin moiety Is specifically co- 
ordinated by the Fingers, with Its C terminus placed 
in the active site between the Palm and the Thumb. 
Binding by ubiquitin aldehyde induces a drastic con- 
formational change in the active site that realigns the 
catalytic triad residues for catalysis. 

Introduction 

In eukaryotes, protein ubiquitination is an essential step 
in the physiological regulation of many cellular pro- 
cesses, such as elimination of damaged or misfolded 
proteins, cell cycle progression, and signal transduction 
(Conaway et al., 2002; Glickman and Ciechanover, 2002; 
Hershko et al., 2000; Hochstrasser, 1996; Laney and 
Hochstrasser, 1999; Pickart, 2001). Ubiquitin is a highly 
conserved 76 amino acid polypeptide. Ubiquitin is joined 
to proteins by an isopeptide bond between the C-ter- 
minal carboxylate group of ubiquitin and the lysine 
€-amino group of the acceptor protein. This process 
depends on the action of three classes of enzymes 
known as ubiquttin-activating ertzyme (El), ubiquittn- 
conjugating ena:ymes (E2), and ubiquitin ligase (E3). 
Conjugation by ubiquitin is critical for the control of 
many key regulatory proteins. Accordingly, ubiquitina- 
tion is itself tightly regulated, and aberrations in this 
pathway are known to lead to a variety of clinical disor- 
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ders (Chung et al., 2001; Schwartz and Ciechanover, 
1999). 

A major function of ubiquitination is to target proteins 
for degradation by the 26S proteasome. However, ubi- 
quitlnated proteins are not always destroyed by the pro- 
teasome. In recent years, protein deubiquitination has 
emerged as an important regulatory step in the ublqui- 
tin-dependent pathways (Chung and Baek, 1999; D'An* 
drea and Pellman. 1 998; Hochstrasser, 1996; Wilkinson, 
1 997). Deubiquitination is carried out by the deubiquiti- 
nating enzymes (DUBs), which comprise two major 
groups: the ubiquitin C- terminal hydrolase (UCH) family 
and the ubiquitin-specific processing protease (UBP) 
family. Members of the UCH family are usually of small 
size ('^20-^0 kDa) and most preferentially cleave ubiqui- 
tin from small peptides and amino acids. Unlike the 
UCHs, the much larger UBPs (60-300 kDa) generally 
appear to be specific for the proteins targeted for ubiqui- 
tination. 

The human genome sequencing project led to the 
identification of more than 90 potential deubiquitinating 
enzymes, making them one of the largest classes of 
enzymes in the ubiquitin system. Greater than 60% of 
these OUBs belong to the UBP class, which appears to 
regulate a diverse set of biological processes (Chung 
and Baek, 1999; Hochstrasser, 1996; Wilkinson. 1997). 
Unlike the highly conserved UCHs, the UBPs contain 
highly divergent sequences and exhibit strong homol- 
ogy mainly in two regions that surround the catalytic 
Cys and His residues; these are the so-called Cys Box 
(<^19 amino acids) and the His Box (60-90 amino acids). 
Although representative UCHs have been structurally 
characterized (Johnston et al., 1997; Johnston et at., 
1999), there is a lack of structural information on the 
UBPs. Consequently, t>oth the catalytic mechanism and 
the regulation of the UBP family of DUBs remain unclear. 

The p53 tumor suppressor protein is a sequence-spe- 
cific transcription factor that can respond to a wide 
variety of cellular stress signals (Levine, 1 997; Vogelstein 
et al., 2000). In an affinity-based approach using GST- 
p53 as a bait, the cellular protein HAUSP (herpesvirus- 
associated ubiquitin-specific protease, also known as 
USP7) was recently identified as a novel p53-interacting 
protein (U et al., 2002). HAUSP specifically deubiquiti- 
nates the ubiquitinated p53 protein both in vitro and in 
vivo, and the expression of HAUSP was found to stabi- 
lize p53 in vivo and to promote p53-dependent cell 
growth arrest and apoptosis (U et al., 2002). These find- 
ings reveeU an important and novel mechanism in which 
p53 degradation is prevented by direct deubiquitination 
and imply that HAUSP might function as a tumor sup- 
pressor in vivo through the stabilization of p53. HAUSP 
is a mammalian UBP for which a specific substrate has 
been identified. 

Here, we report the crystal structures of the 40 kDa 
catalytic domain of HAUSP in isolation and in a complex 
with ubiquitin aldehyde. We discuss the structural in- 
sights into the catalytic mechanisms as well as the regu- 
lation of the UBP family of DUBs. We also present bio- 
chemical evidence on how HAUSP recognizes p53. 
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Results 

Structure of the Catalytic Core Domain 
The full-length HAUSP protein (residues 1-1102) was 
expressed in baculovirus-lnfected Insect cells and was 
purified to homogeneity (see Experimental Procedures). 
HAUSP exists as a monomer in solution as judged by 
gel filtration and is functional as evidenced by its ability 
to recognize and deubiquitinate p53 (data not shown). 
To define the domain boundaries in HAUSP, we em- 
ployed an approach combining limited proteolysis by 
subtilisin, sequence alignment with other UBP family 
members, and binding and deubiquitination assays. This 
characterization identified a 40 kDa fragment of HAUSP 
(residues 208-560) as the catalytic core domain, which 
mediates ubiquitin binding and deubiquitination of suk>- 
strate. 

To gain functional insights into the UBPs, we crystal- 
lized the catalytic core domain and determined its struc- 
ture at 2.3 k resolution using multi-wavelength anoma- 
lous dispersion (Table 1 and Figure 1A). There 3TB two 
molecules of HAUSP In each asymmetric unit; they ex- 
hibit an identical set of stmctural features and can be 
superimposed with a root-mean-square deviation 
(rmsd) of 0.6 A for all Ca atoms. Thus, we focus our 
discussion only on one HAUSP molecule (Rgures IB 
and 2). 



The HAUSP catalytic core domain, with a dimension 
of 75 A X 45 A X 40 A. resembles an extended right 
hand comprised of three domains, Fingers, Palm, and 
Thumb (Figure 1 B). The Thumb consists of eight a heli- 
ces (a1-a6, a9, and alO), with the N-terminal Cys Box 
adopting an extended confomnation. The Palm contains 
eight central p strands 03. p8-pi4), which are but- 
tressed by two a helices (a7 and a8) and several surface 
loops. An anti-parallel p sheet 08, pi 0-01 4), fornied by 
six of the eight p strands from the Palm, intimately packs 
against the globular Thumb and gives rise to an inter- 
domain deep cleft The Cys Box and the His Box are 
positioned on the opposing sides of this cleft (Figure 
1 B). The Fingers, comprised of four p strands in the 
center (pi , p2, P4, and p7) and two at the tip (P5 and 
P6), reach out more than 30 A from the Palm-Thumb 
scaffold. 

The three-domain structure of HAUSP creates a prom- 
inent binding surface between the tip of the Fingers and 
the Palm-Thumb scaffold. The size and shape of this 
binding pocket appear to be Ideal for the 8 kDa protein 
ubiquitin (Figure 1 B). Moreover, an orientation for the C 
terminus of a bound ubiquitin is implied by the connec- 
tion of this binding surface with the deep catalytic cleft 
between the Palm and the Thumb (Figure 1 B, right im- 
age). This region of the HAUSP surface is enriched with 
acidic amino acids. 
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Figure 1 . Structure of the Catalytic Core Domain of H AUSP 

(A) A representative portion of the experimental electron density map in stereo view. The map, contoured at 1 .5 (7 at 2.3 A resolution, is shown 
around the active site region. Some Important residues are labeled. 

(B) Overall structure of the 40 kDa catalytic core domain of HAUSP. The structure comprises three domains. Fingers (colored green), Palm 
(blue), and Thumb (gold). The active site, eompiising the Cys Box (cyan) and the His Box (purple), is located between the Palm and the Thumb. 
The surface, represented by electrostatic potential, is shown on the right A deep cleft runs through the Palm and the Thumb. The predicted 
ubiquitin binding site is indicated by a black oval circle. 

(C) A stereo view of the active site between the Palm and the Thumb. Note that the catalytic residue Cys223 is nearly 1 0 A away from His464, 
which was thought to activate Cys223 through deprotonation. Ail figures were prepared using MOLSCRIPT (Kraulis, 1991) and GRASP (Ntcholls 
et al., 1991). 



To investigate whether the three-domain architecture 
(Fingers-Palm-Thumb) of HAUSP is conserved among 
other UBP members, we aligned the primary sequences 
of HAUSP and six representative UBP proteins (Figure 
2). The sequence alignment result was crossvalidated 
with structural information on HAUSP. This analysis re- 
vealed that the secondary structurEU elements In the 
three domains are generaJly comprised of conserved 



residues (Figure 2). For example, Lys391, on the |J7 
strand of the Fingers, is invariant among all aligned 
UBPs. More importantly, residues that contribute to the 
structural integrity of the Fingers, the Palm, and the 
Thumb are highly conserved. Notably, the limited hy- 
drophobic core at the tip of the Fingers is formed by the 
side chains of Tyr379 on strand ^5, Ile332 and Cys334 on 
strand pi, and Leu373 on strand p4. Each of these four 
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Figure 2. Sequence Alignment of HAUSP with Six Representative UBP Family Proteins 

Conserved residues are shaded in yellow whereas the catalytic triad is highlighted in red. Residues that are involved in direct inter-molecular 
hydrogen borxl interactions using their side chains and main chains are marked with purple and green arrows, respectively. Residues that 
are involved in van der Waals contact with Ubal are labeled with blue squares. Residues that coordinate the oxyanton through hydrogen 
bonds are identified with blue triangles above the alignment The secondary structural elements above the sequences are Indicated for the 
free HAUSP (lower) and the ubtquitin-bound HAUSP (upper), respectively. The coloring scheme for the secondary structural elements of free 
HAUSP Is the same as in Figure 1. Sequence allgrunent employed the programs CtustalW (Thompson et al., 1994) and Block Maker (Henikoff 
et aU 1995). Based on structural similarity, relevant sequences of the UCH protein Yuhl are also shown below the alignment. Entries 
shown are from the SwIssProt Database: HAUSP (Human; SWK^009): UBP1 5 (SaccAarom/ces cerevtsiae; SW:P50101): UBP8 (S. cerews/ae; 
SW:P50102); DOA4 (S. cerevtetoe; SW:P32571): FAF {prosofihila melanogaster, SW:P55824): UBP5 (Human; SW:P45974); USP14 (Human; 
SW:PS4578}; and YUHl (S. cereWstoe; SW:P35127). 
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residues is preserved in at least four other UBP proteins 
and replaced by a compatible residue In the rest (Figure 
2). These observations strongly suggest that the Fin- 
gers-Palm-Thumb architecture of HAUSP is conserved 
among other UBP proteins. 

Misaligned Active Site of HAUSP 
All members of the UBP family of DUBs share strong 
homology in the Cys and His Boxes. The Cys Box con- 
tains the catalytic cysteine residue, which is thought to 
undergo deprotonatton and to unleash a nucteophilic 
attack on the cart>onyl carbon atom of the ubiqultin 
Gly76 at the scissile peptide bond. In analogy with other 
cysteine proteases, the deprotonation of this cysteine 
residue most likely is assisted by an adjacent His resi- 
due, which, in turn, is stabilized by a nearby side chain 
from an Asn or Asp residue. Together, these three resi- 
dues constitute the so-called catalytic triad. Previous 
mutagenesis studies on several UBPs have provided 
evidence that these residues have critical roles In cataly- 
sis (e.g., Baek et al.. 2001; Huang et al., 1995; Gilchrist 
and Baker, 2000). 

The highly conserved Cys and His Boxes are posi- 
tioned on opposite sides of the catalytic cleft (Figure 
1 C). Although Cys223 appears to be the nucleophile in 
HAUSP, the identities of the other two catalytic residues 
remain uncertain. His464, a candidate catalytic residue, 
donates a hydrogen bond to Asp4B1 (Figure 1 C); thus, 
this His-Asp pair likely constitutes two residues in the 
catalytic triad. However, the closest distance from the 
side chain of His464 to that of Cys223 is 9.7 A, too 
far away to allow any meaningful interactions. Another 
invariant histidine residue in the sequence sUignment 
(Figure 2), His456, is separated approximately 7.5 A 
away from Cys223 by an intervening residue, Tyr465 
(Rgure 1C). Moreover, the sulfur atom of Cys223 Is not 
within a reasonable distance of any charged or polar 
side chain to allow direct interactions. These structural 
observations indicate that the active site of the free 
HAUSP core domain exists in an unproductive confor- 
mation, and that substrate binding will likely trigger a 
conformational change that results in catalysis. 

Overall Structure of a HAUSP-Ubal Complex 
Ubiqultin aldehyde (Ubal). a ubiquitin derivative in which 
the C-terminal cartioxylate Is replaced by an aldehyde, 
is a potent inhibitor of most DUBs as it forms a thiohemi- 
acetal with the catalytic Cys, mimicking a reaction inter- 
mediate (Hershko and Rose, 1 987; Johnston et al., 1 999). 
To elucidate the catalytic mechanisms of the UBPs, we 
prepared Ubal and reconstituted a covalent complex 
between HAUSP and the inhibitor. We crystallized this 
binary complex and solved its structure at 2.3 A resolu- 
tion by nrK>lecular replacement (T able 1 ). Each asymmet- 
ric unit contains two HAUSP-Ubal complexes; they are 
nearly identical with a rmsd of 0.53 A for 41 9 Ca atoms. 
For simplicity, we focus our discussion on one such 
complex (Rgure 3A). 

As anticipated, Ubal binds to the putative substrate 
binding surface of HAUSP and makes extensive con- 
tacts with both the Rngers and the Palm-Thumb scaffold 
(Rgure 3A). The Ubal C temninus Is bound in the deep 
catalytic cleft between the Palm and the Thumb, with a 



thiohemiacetal linkage formed between the aldehyde 
group and the side chain of Cys223. Accompanying the 
complex formation, two previously flexible surface loops 
of HAUSP become ordered, with one of these directly 
contributing to the coordination of Ubal (Figure 3A). 

Binding by Ubal induces several prominent conforma- 
tional changes in the catalytic core domain, resulting in 
a rmsd of 1 .3 A for 320 aligned Ca atoms between the 
free and Ubal-bound HAUSP structures (Figure 3B). 
These changes are particulariy dramatic in the immedi- 
ate vicinity of the catalytic cleft, where the main chain 
loops between a4 and a5 and between ^10 and p11 
move 5-^ A (Rgure 3B, gold arrows). Interestingly, Ubal 
binding also draws the tip of the Fingers 05) and the 
edge of the Thumb (a5) closer by 2-3 A (Figure 3B, black 
arrows); this is likely due to the fact that Ubal makes 
direct interactions primarily with the tip of the Fingers 
and the catalytic cleft between the Palm and the Thumb 
(Figure 3C). 

The HAUSP-Ubal interactions result in the burial of 
3600 A^ exposed surface area (Figure 3C), of which 30% 
comes from the interface between Ubal and the tip of 
the Fingers. The rest is derived from the interface be- 
tween the C-termina) portion of Ubal and its surrounding 
HAUSP elements, particularly the catalytic cleft. Intrigu- 
ingly, although some basic residues of Ubal directly 
interact with the acidic surface of HAUSP, much of the 
buried surface area on Ubal is uncharged. How can the 
uncharged Ubal surface interact with the predominantly 
acidic surface of HAUSP? Examination of the structure 
reveals that a layer of ordered water molecules are sand- 
wiched between Ubal and the middle portion of the 
Rngers (Rgure 3D). These water molecules form exten- 
sive networks of hydrogen bonds among themselves as 
well as to the side chain and main chain atoms of HAUSP 
and Ubal. Thus, the binding of Ubal by HAUSP can be 
visualized as grabbing Ubal with the tip of the Fingers 
and the catalytic cleft between the Palm and the Thumb, 
with cushioning waters in between. 

These trapped water molecules, absent in the struc- 
ture of the free HAUSP, exhibit low temperature factors 
in the complex and might be expected to abate the 
affinity of HAUSP for ubiquitin and, as a consequence, 
may enhance the selectivity for ubiquitinated p53 over 
other ubiquitin conjugates. In support of this analysis, 
ubiquitin does not form a stable complex with HAUSP 
as judged by gel filtration. Furthermore, high Km values 
were observed in deubiquitinatlon assays with three dif- 
ferent substrates (Km s 25 jiM for ubiquitin-AMC, and 
Km > 50 M,M for ubIquitin-OM or K48-linked diubiquitin; 
data not shown). These results are in marked contrast 
to the low micromolar and submicromolar Km values 
observed for several UCHs (Larsen et al., 1996; Lam et 
al., 1997a; Dang et al., 1998; Johnston et al.. 1999). 

Realignment of the Active Site Is Induced 
by Ubiquitin Binding 

The active site of free HAUSP exists in an unproductive 
conformation (Figure 1 C). Upon Ubal binding, structural 
elements surrounding the catalytic cleft undergo dra- 
matic changes that realign the active site residues for 
productive catalysis (Figure 4A). Compared to the free 
HAUSP, Cys223 and His464 shift over a distance of 4.8 
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Figure 3. Overall Structure of the HAUSP-Ubal Complex 

(A) Overall structure of the catalytic core domain of HAUSP (208-560, blue) covalently bound to Ubal (green). The previously disordered regions 
In the free HAUSP are highlighted in red. The catalytic triad residues are shown as yellow sticlcs. 

(B) Superposition of structures of HAUSP in isolation (purple) and In complex with Ubat (blue). Red and gold arrows indicate regions that 
become ordered and that undergo drastic conformational changes upon binding to Ubal, respectively. 

(C) Schematic diagram of how ubiquitin is bound by the HAUSP catalytic core domain. After binding, Interactions between ubiquitin and 
HAUSP occur primarily at the opposing ends of the Fingers, as Indicated by red circles. Color choices are: blue, the tip of the Flngere; cyan, 
Cys box; purple. Hb box; yellow, Ubal C terminus; and green, Ubal surface that interacts with the tip of ttie Fingers. 

(D) Location of buried water molecules. HAUSP is represented as a surface mesh while Ubal is shown as a green coil. A total of 17 buried 
water molecules are sandwiched between the middle portion of the Fingers and the Ubal. These water molecules mediate numerous hydrogen 
bonds behiveen HAUSP and Ubal. 



and 2.4 A, respectively, toward the bound C terminus 
of Ubal (Figure 4A). Consequently, the N^^ atom In the 
imidazole ring of His464 Is now 3.6 A away from the 
atom in the side chain of Cys223, close to a hydrogen 
bond distance. The stabilizing hydrogen bond between 
His464 and Asp481 is preserved (Rgure 4A). Although 
the conformational switch involves a distance change 
of more than 8 A for regions of the catalytic cleft, there 
is no disruption of the overall structure and all of the 



secondary structural elements are maintained in the Fin- 
gers, the Palm, and the TTiumb (Figure 2). This remarlt- 
able conformational flexibility may be essential to the 
specific deubiquitination activity of the UBPs. 

Another catalytic feature Is the formation of the oxy- 
anion hole, which refers to the accommodation of the 
negative potential formed on the carbonyl oxygen atom 
at the scissile bond. Typically, the oxyanion is stabilized 
by hydrogen bonds from the backbone amide group of 
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Figure 4. Specific Interactions between HAUSP and Ubal 

(A) A large conformational change at the active site Induced by Ubal binding. The active sites of HAUSP in isolation (purple) and in complex 
with Ubal (blue) are superimposed and shown in stereo. The C-terminal tail of Ubal is shown In green. The catalytic triad residues and Asn218 
are shown. Note the dramatic conformattonaJ changes on all three catalytic residues, which realign these residues for productive catalysis. 
Hydrogen bonds are represented by red dashed lines. 

(B) A stereo view of the hydrogen bonds between HAUSP and the C terminus of Ubal. A water molecule, marked by the letter w, likely plays 
an important role by hydrogen kionding to the oxyanion and two surrounding residues (Asn216 and Asp482). 

(C) A stereo view of the van der Waals Interactiofts between HAUSP and the C terminus of Ubal. The skie chains of the Ubal C-terminal six 
residues as well as several critical HAUSP residues are shown. 

(D) A stereo view of the interactions between the tip of the Fingers In HAUSP and UbaL All HAUSP residues shown in (A-0) are highly conserved 
among all members of UBPs. 

(E) Deubiquftlnation activity of mutam HAUSP proteins. The substrate used in this assay is Lys48-linked diubiquitin. The role of the affected 
residue b briefly indicated. HHAA represents the double mutation H456A/H464A. Mutation of any catatytically important residue leads to 
complete abolishment of the deubiquitination activity. 
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the catalytic cysteine as well as from a neighboring Gin 
or Asn. it was previously unclear how the oxyanion is 
coordinated in UBPs. In the HAUSP-Ubal complex, the 
oxyanion that normally would form during catalysis is 
mimiclced by the hydroxyl of the thiohemiacetal (i.e., the 
oxygen from Ubal residue 76). This group can accept 
two hydrogen bonds and is within hydrogen bond dis- 
tance of three potential donors, the backbone amide of 
Cys223 and Thr222 and the side chain of Asn21 8 (Figure 
4B). In addition, a nearby water molecule, supported by 
two direct hydrogen bonds from Asp482 and Asn21 8, 
further neutralizes the negative potential of the oxyanion 
(Figure 4B}. This network of hydrogen bonds is further 
buttressed by an additional intra-molecular contact be- 
tween the side chains of Asp482 and Asn226. 

Significantly, residues that are involved in the coordi- 
nation of the oxyanion are highly conserved among all 
members of the UBPs (Rgure 2). For example, Asn21 8 
and Asp482 are invariant among the seven representa- 
tive U BP proteins whereas /Vsn226 is present in six mem- 
bers and is replaced by Ser in UBP8 (Figure 2). This 
analysis suggests a conserved mechanism for all UBPs 
in the formation of the oxyanion hole and catalysis. The 
active site geometry of the Ubal-bound HAUSP closely 
resembles that of the papain family of cysteine prote- 
ases (Rawlings and Barrett, 1994). 

Specific Interaction between HAUSP and Ubal 
Ubal primarily contacts two regions of HAUSP, the tip 
of the Fingers and the catalytic cleft region between the 
Palm and the Thumb (Figure 3C). The C terminus of Ubal 
is primarily coordinated by a network of 1 2 hydrogen 
bonds between the main chain groups of Ubal and the 
main chain and side chain atoms of HAUSP (Figure 4B). 
Notably, all backbone carbonyl carbon and amide nitro- 
gen atoms in the C-termlnal five residues of Ubal are 
involved In direct hydrogen bonds to neighboring resi- 
dues in HAUSP (Figure 4B). 

In addition to hydrogen bonds, inter-molecular van 
der Waals contacts also contribute to the specific recog- 
nition of the C terminus of Ubal. The two hydrophobic 
residues In the C terminus of Ubal, Leu71 and Leu73, 
point into the catalytic cleft by making direct van der 
Waals contacts to the surrounding HAUSP residues 
(Phe409 and Tyr411 on the newfy formed strand pO, 
Lys420 on strand ^0', and TyrSI 4 on strand |S1 4) (Figures 
2 and 4C). In contrast, the two positively charged resi- 
dues in the C terminus of Ubal, Arg72 and Arg74, point 
up into the open space away from the cleft. Only Arg72 
hydrogen bonds to Glu298 of HAUSP. The last two resi- 
dues of Ubal, Gly75 and Gfy76, fit in the narrowest region 
of the catalytic cleft between the Palm and the Thumb. 
The space surrounding the backbone Cot atoms of these 
two glycine residues Is Insufficient to accommodate any 
other side chain, consistent with the specific function 
of the UBPs. 

On the other side of the HAUSP-Ubal interfiace, resi- 
dues at the tip of the Fingers also make important contri- 
butions to the binding of Ubal. These interactions, in- 
cluding 12 direct hydrogen bonds and one patch of 
van der Waals contacts (Figure 2), primarily invoh/e the 
N-terminal residues of Ubal as well as a few amino acids 
In the middle stretch. Gln2. Lys46, Glu64, and Thr66 



make two hydrogen bonds each to the HAUSP residues 
Lys378 and Asp380, AspSOS and Glu308. Ser341 and 
Tyr379, and Gtu345 and Lys391 , respectively (Figures 2 
and 4D). In this interface, the only van der Waals interac- 
tions occur between Phe4 of Ubal and a hydrophobic 
pocket formed by Ile332, Leu373, Tyr379, and the ali- 
phatic portion of the side chain of Lys391 in HAUSP 
(Figure 4D). 

All of the important HAUSP residues responsible for 
binding to Ubal are conserved among the seven aligned 
UBP members (Figure 2). Out of a total of 25 invariant 
residues among these proteins, 14 contribute directly 
to the specific recognition of ubiquitin. For example, 
Asp295 and Glu298 on helix a5, His456 on strand ^10, 
and Tyr465 on strand 311 , all invariant among the seven 
UBP proteins (Figure 2), make direct hydrogen bonds to 
coordinate the C terminus of Ut>al. Lys391 , an invariant 
residue on strand ^7 of the Fingers, mediates both direct 
hydrogen bond (to Thr66) as well as van der Waals con- 
tacts (to Phe4 of Ubal). The invariant Phe409 and highly 
conserved Arg408 form part of the hydrophobic pocket 
to hold Leu71 and Leu73 of Ubal. This analysis solidifies 
the notion that all UBPs share conserved mechanisms 
of ubiquitin binding and catalysis. In addition, these ob- 
servations further support the generality of the observed 
Rngers-Palm-Thumb architecture of the UBPs. 

Mutational Analysis 

To corrot>orate our structural studies, we assayed deu- 
blquitlnatlon in vitro using Lys48-iinked diubiquitin as 
the substrate. Quantitative assays (see Experimental 
Procedures) with subsaturating substrate O-^., [S] << 
Km) established that the HAUSP catalytic domain and 
full-length HAUSP cleave diubiquitin with similar rates 
(kea/Km = 13 and 1 8 min"' mM"\ respectively). To probe 
the functional significance of the residues in the catalytic 
cleft, we performed alanlne-scanning mutagenesis on 
29 amino acids in the Palm and the Thumb domains of 
the HAUSP catalytic domain. All 29 mutant proteins were 
purified to homogeneity and were individually examined 
for their activity in a semiquantitative In vitro assay; 
some relevant results are shown (Figure 4E). As antici- 
pated, whereas the wild- type enzyme was able to reduce 
the substrate to mono-ubiqultin, mutation of any of the 
catalytic triad residues (Cys223. His464, and Asp481) 
or residues that comprise the oxyanion hole (Asn21 8, 
Asn226, and Asp482) resulted in an undetectable level 
of catalytic activity (Rgure 4E). In addition, mutation of 
His456 or Asp295, which hydrogen bonds to the car- 
bonyl of Arg74 or the amide of Leu73 of Ubal, respec- 
tively, also led to undetectable activity. In contrast, mu- 
tation of any of the four solvent-exposed residues 
(Asp289, Gln293, Asp483, and TyrSI 4) had little effect 
on the deubiquitination activity of HAUSP (Figure 4E). 
Two residues, Gtn230 and Tyr224, contribute to the 
structural integrity of HAUSP by making contacts to 
surrounding residues; their mutations also resulted In 
loss of activity (Figure 4E). These results are in complete 
agreement with our structural analysis. 

Structural Comparison with UCH and ULP 

To reveal common features of catalysis and substrate 

binding, we superimposed the structure of the HAUSP- 
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Figure 5. Structural Comparison of HAUSP- 
Ubal with Yuhl-Ubal and Ulp-Smt3 Com- 
plexes 

(A) Superposition of the structure of the 
HAUSP-Ubal complex with that of Yuhl -Ubal. 
These two stnictures are aligned on their deu- 
biqultinatlon domains. HAUSP and Yuhl are 
shown in blue and purple, respectively- The 
HAUSP- and Yuh1-£tssoclated Ubal moietjes 
are represented as transparent surfaces col- 
ored green and yellow* respectively. Two per- 
pendicular views are shown. 

(B) Stereo comparison of the active sites of 
HAUSP (blue) and Yuhl (purple). The C-ter- 
minal tails of Ubal is shown In green and yel- 
low for the HAUSP and Yuhl complexes, re- 
spectively. Catalytic triad residues and the 
oxyanion-coordinating residue are shown. 
Hydrogen bonds are represented by red 
dashed lines. The catalytic triad residues are 
superimposed with a rmsd of 0.23 A. 

(C) Stereo comparison of the active sites of 
HAUSP (blue) and Ulpl (gold). The C-terminal 
tails of Ubal and Smt3 (SUMO homotog in 
yeasQ are shown in green and yellow, respec- 
tively. Catalytic triad residues and the oxy- 
anlon-coordlrurting residue are shown. 
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Ubal complex onto that of the Yuhl-Ubai complex 
(Johnston et al., 1 999). This alignment resulted in a rmsd 
of 1 .87 A for 96 similar backbone Ca atoms of the deubi- 
quitination domain, with 1 2% sequence identity (Figures 
5A and 2). Both the topology and the overall structure 
of HAUSP are quite different from that of Yuhl , a mem- 
ber of the UCH family of OUBs. Due to a different topol- 
ogy, sequences of the Yuhl protein are permuted in this 
alignment (Figure 2). Yuhl lacks the Fingers domain. 
In addition, the 8-a helix Thumb domain of HAUSP is 
reduced to a tight 5-helicaJ bundle in Yuhl (Rgure 5A). 
Although the Palm domain of HAUSP maintains the 
same overall fold as the corresponding regions In Yuhl , 



large deviations are apparent throughout the structure 
(Figure 5A). 

Recognition of Ubal is also quite different between 
HAUSP and Yuhl . Due to the lack of the Fingers, Yuhl 
h£is only one contiguous Interface that contacts Ubal. 
Although the side chain of Arg74 anchors the C terminus 
of Ubal by making a total of six hydrogen bonds to Yuhl , 
the same group has no interaction with surrounding 
residues In HAUSP. In contrast, although Ubal LeuTI 
contributes little to Ubal-Yuhl interactions, this residue 
makes multiple van der Waals contacts to two aromatic 
residues in HAUSP, Phe409 and Tyr 41 1 . 

Despite divergent structures, HAUSP and Yuhl ex- 
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hibit a nearly identical geometry at the active site (Figure 
56). The catalytic triad residues of HAUSP can t>e super- 
imposed with those of Yuhl with a rmsd of 0.23 A. The 
oxyanion is aJso partially stabilized by similar residues 
in the same general area, Asn2l8 in HAUSP and GlnB4 
in Yuhl . This analysis indicates that hydrolysis of bound 
substrates by the UCHs and the UBPs is likely to employ 
the same mechanism. However, important differences 
in the active site geometry exist prior to substrate bind- 
ing. For Yuhl , the active site residues are unlikely to 
undergo any significant conformational change during 
catalysis as the Ubal-bound conformation is nearty iden- 
tical to that in the unliganded structure of another UCH 
member, UCH-L3 (Johnston et al.. 1997). In the case of 
HAUSP. a dramatic structural shift involving the active 
site residues takes place upon Ubal binding (Rgure 4A). 

The surprisingly large differences between the struc- 
tures of the UBPs and the UCHs may be related to their 
functional differences in biology. For example, the UCHs 
contain an active site crossover loop that does not allow 
passage of large substrates such as folded proteins 
(Johnston et al., 1999). This structural feature indicates 
that, if unassisted, the UCHs are unlikely to deubiquiti- 
nate proteins. On the other hand, the open-cleft struc- 
ture of the HAUSP catalytic domain and the unique 
three-domain architecture are ideal for the deubiquitina- 
tion of large substrates such as a polyubtquitin chain or 
ubiquitinated protein. 

In addition to ubiquitination, conjugation of the small 
ubiquitin-related modifier (SUMO) to proteins also regu- 
lates numerous cellular processes (Hochstrasser, 1998). 
The crystal structure of an ubiquitin-like protein specific 
protease (ULP), Ulpl , which catalyzes the deconjugation 
of SUMO (Smt3 In yeast) from target proteins, was re- 
ported in a covalent complex with Smt3 (Mossessova 
and Lima, 2000). Although the overall structure of the 
Ulpl -Smta complex is quite different from the HAUSP- 
Ubal complex with 1 .7 A rmsd for 62 aligned Ca atoms 
(data not shown), the active site conformation is highly 
conserved (Figure 5C). Similar to Yuhl , the catalytic triad 
of Ulpl can be superimposed with that of HAUSP with 
0.21 A rmsd. In addition, the stabilization of the oxyanion 
involves a similar residue (Gln574) in a similar location 
in Ulpl (Figure 5C). These analyses indicate that the 
UBPs, the UCHs, and the ULPs all employ a highly con- 
served catalytic mechanism for the deconjugation of 
ubiquitin and SUMO from target proteins. 

HAUSP Recognizes the C Terminus of p53 

p53 contains an N-terminal transactivatton domain (1 5- 
30). a central DMA binding core domain (94-292), a tet- 
ramerization domain (326-355). and a C-terminal regu- 
latory domain (356-393) (Levine, 1997). It has been 
reported that the N-terminal domain (residues 1-248) of 
HAUSP was sufficient for binding to p53 (Li et al., 2002). 
The N-terminal 52 residues of HAUSP are 75% hydro- 
phillc and are likely to t>e flexible in solution as they 
are readily removed from the rest of the sequences by 
limited proteolysis. To further define the domain bound- 
ary, we generated a series of deletion variants of HAUSP, 
expressed and purified these recombinant proteins, and 
examined their ability to Interact with p53. Using gel 
filtration, we found that a protease-resistant fragment 



(53-208) of HAUSP was both necessary and sufficient 
for stable interactions with p53 whereas neither the core 
domain (208-^60) nor the C-terminal domain (560-1 102) 
formed a stable complex with p53 (Figure 6A). 

To identify the minimal sequence requirement in p53, 
we generated a number of deletion variants and assayed 
their interaction with HAUSP (53-208) using gel filtration 
(Figure 6B). Neither the DNA binding core domain nor 
the oligomerization domain of p53 was required for the 
formation of a stable complex with HAUSP. Rather, a 
C-terminal 32-residue peptide of p53 (351-382) con- 
tained the determinants for the interaction with HAUSP 
(Figures 6B and 6C). In particular, a short 11 -residue 
peptide stretch (357-367) of p53 plays a critical rote 
in binding to HAUSP as removal of this sequence to 
generate p53 (325-356) eliminated the interaction with 
HAUSP (Figure 6B). These analyses demonstrate that 
the N-terminal domain (53-208) of HAUSP stably inter- 
acts with a minima] C-terminal peptide (357-382) of p53 
(Figure 6B). Supporting this conclusion, the N-terminal 
domain of HAUSP (residues 58-1 96) was found to share 
significant homology (up to 32% sequence identity) to 
the TRAF (TNF receptor associated factor) domain (Za- 
pata et al., 2001), a known peptide binding motif (Chung 
et al.. 2002). 

Discussion 

Although protein ubiquitination and subsequent protea- 
somal degradation have been studied extensively, pro- 
tein deubiquitination and hence stabilization remain less 
well characterized (Chung and Baek. 1999; D*Andrea 
and Pellman. 1998; Wilkinson, 1 997). Many and possibly 
most of the UBPs function to deubiquitinate conjugates 
of specific proteins. HAUSP is a UBP originally identified 
as a cellular protein that interacts with the immediate- 
eariy protein Vmwl 1 0 of the Herpes simplex virus (Ever- 
ett et al., 1997). Recently, HAUSP was found to modulate 
the stability and function of the tumor suppressor pro- 
tein p53 via specific deubiquitination (Li et al.. 2002). In 
this study, we report the structure of a UBP catalytic 
core domain, which reveals a novel three-domain archi- 
tecture comprised of the Fingers, Palm, and Thumb do- 
mains. It is of particular note that this nomenclature 
should not t>e confused with that of the DNA polymerase, 
where Fingers. Palm, and Thumb refer to different enti- 
ties (Steitz, 1 999 and references therein). We also report 
the structure of the catalytic core domain of HAUSP 
bound to ubiquitin aldehyde, which reveals a dramatic 
conformational change in the active site upon binding 
by Ubal. The conformational state of HAUSP bound to 
Ubal is likely to resemble that of HAUSP bound to sub- 
strate. 

In a search for structural homologs using the program 
DALI (Holm and Sander, 1993), the Ca^^-bound form of 
the p. calpain protease core (PDB code 1 KXR) was found 
to exhibit the highest degree of similarity to HAUSP. with 
a rmsd of 2.1 A for 87 aligned Ca atoms (9% sequence 
identity). Interestingly. Ca'"^ binding to |x calpain also 
induces a large conformational switch in the active site, 
realigning a separated catalytic triad into a productive 
conformation (Moldoveanu et al., 2002). The extent of 
this change upon Ca''^ binding is similar to that otiserved 
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Figure 6. The N-Terminal Domain of HAUSP Recognizes the C-Temninat Peptide Sequences of p53 

(A) tdenttfi cation of the N-termtnal domain of HAUSP as the primary p53 binding motif. Various purified recombinant HAUSP fragments were 
individually incubated with a large p53 fragment (residues 94-393) and then subjected to a gel filtration analysis. The results are summarized 
here. 

(B) Mapping of a minimal p53 fragment that is both necessary and sufficient for the formation of a stable complex with HAUSP. 

(C) A representative gel filtration run for the complex between HAUSP (53-208) and pS3 (351-382). Allquots of the fractions were visualized 
by Coomassie staining following SDS-PAGE. The p53 (351-^82) peptide, which in isolation was eluted from gel filtration in fractions 38^^, 
was eluted in fraction 31-33 when complexed with HAUSP (53-208). 



In the HAUSP catalytic core domain upon Ubai binding. 
In contrast to the Ubal binding-regulated enzynr^e activity 
for HAUSP, the calpain protease Is regulated by the 
local Ca^^ concentration and only acts upon surrounding 
proteins in response to elevated Ca'^. 

Polyubiquitin conjugated to target proteins can be 
assembled through different lysine residues of ubiquitin, 
producing different outcomes (Pickart, 2000). Polyubi- 
quitin linlced by Lys48 or, more rarely, Lys29 is thought to 
direct proteins to the 26S proteasome for degradation, 
whereas Lys63-ijnl(ed polyubiquitin appears to signal 
other responses such as ONA repair arid endocytosis. 
Both Lys29 and Lys63 are largely solvent accessible in 
the ubIquitJn moiety of the HAUSP-Uk>al complex, but 
the Lys48 side chain is involved in hydrogen bonds to 
residues in helix a5 of HAUSP and is only partially ex- 
posed to solvent Examination of the local structure indi- 
cates that the side chain of Lys48 can be linked to 
the C-terminal cartx>xy]ate of another ubiquitin without 
disruption of the HAUSP-Ubal interface. Nevertheless, 
Lys48 is not as freely available as the other ubiquitin 
lysine residues. Moreover, the HAUSP residues Asp305 
and Glu308 that coordinate ubiquitin Lys48 are con- 
served as acidic amino acids in some other UBPs (e.g., 
SCUBP15 and dFAF in Figure 2). A possible function of 
these residues is to promote binding of a ubiquitin moi- 
ety that contains a free Lys48 side chain. Thus, cleavage 



by HAUSP might be biased toward the ubiquitin at the 
distal end of a K48-linked chain or toward conjugates 
with either monoubiquitin or non-K48-linked polyubi- 
quitin. 

At present, which ubiquitin lysine(s) is used in the 
MDM2-mediated potyubiquitination of p53 has not been 
established. It also is unclear whether the p53 conju- 
gates acted on by HAUSP contain one or more polyubi- 
quitin chains or instead are monoubiquitinated at multi- 
ple sites. Severed lysines clustered near the C terminus 
function as ubiquitination sites on p53 (Naksmiura et al., 
2000; Rodriguez et al., 2000), and simultaneous mutation 
of these lysine residues Oyslne 370, 372, 373, 381 , 382, 
and 386) interfered with MDM2-dependent ubiquitina- 
tion and degradation of p53. Surprisingly, ubiquitination 
of p53 by MDM2 in vitro was reported to yield conjugates 
that contained multiple mono-ubiquitin moieties (Lai et 
al., 2001). Establishing the specificity of HAUSP and 
whether it can remove entire polyubiquitin chains or 
only monoubiquitins from p53 should give considerable 
insight into the nature of ubiquitin-p53 conjugates in 
vivo. The presence of the p53 binding domain at the 
N-termtnal portion of HAUSP argues strongly that it may 
prefer to cleave a conjugate at the proximal ubiquitin, 
i.e., between p53 and the attached mono- or polyubiqui- 
tin chain. In our in vitro assays, full-length HAUSP and 
the catalytic core domain were equally able to cleave 
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diubiqultin into mono-ubiquitin (Figure 4E and data not 
shown), and the full-length HAUSP removed all of the 
ubiquitins from the p53 conjugates (Li et al., 2002 and 
data not shown). However, this point, as well as an 
evaluation of HAUSP deublquitination activity on mono- 
ubiquitin versus potyublquitin p5d conjugates, awaits 
quantitative kinetic experiments with defined sut>- 
strates. 

In this study, we identified the C-terminal peptide se- 
quences of p53 as necessary and sufficient for binding 
to the p53-recognltion domain of HAUSP. The minimal 
peptide (residues 357-382) comprises only 26 amino 
acids (Figure 6B) but contains five of the six putative 
ubiqultination sites. The extensive overiap of these mul- 
tiple ubtquttination sites with the HAUSP binding site 
imposes important structural constraints on HAUSP. 
Rrst, the HAUSP pSS-recognltion domain must be In 
close proximity to the active site cleft of the catalytic 
domain. Second, to accommodate these multiple ubi- 
quitin attachment sites, the connection between the 
HAUSP catalytic core domain and the p53 binding do- 
main must be flexible in order to position each ubiquitin- 
p53 linkage within the active site for nucleophilic attack 
by Cys223. This organization Is likely to be critical for 
the efficient deubiquitination of p53 In vivo. 

Experimental Procedures 
Prot^ Preparation 

All constructs were generated using a standard PCR-based cloning 
strategy. The catalytic core domain of HAUSP (208-560) and all 
mutants were cloned into the vector pGEX-2T (Pharmacia). The 
full-length HAUSP protein was cloned into the BacutoGotd virus 
(PharMingen) and expressed In insect cells. All HAUSP proteins 
were purified using glutathione sepharose 4B resin as described 
(Chai et al., 2001). The QST moiety was proteolytically removed by 
thrombin for all recombinant proteins in this study. Seleno-Met- 
substituted HAUSP (208-^60) was generated as described (Qin et 
aL, 1999). 

Generation of Ubiquitin Aldehyde (Ubal) 
and a HAUSP-Ubal Complex 

Ubal was prepared by carboxypeptidase Y-catatyzad exchange of 
3-anilno>1, 2-propanediol for ubiquitin Gly76 and the sutssequent 
oxidation of the ubiquitin-diol product with Nal04 (Ounten and Co- 
hen, 1989; Lam etat., 1997b). The Ubal thus obtained was incubated 
in 4>fold excess over HAUSP protein at pH 6 (25 mM Tris, 100 mM 
NaCI, and 5 mM DTT), and the HAUSP-Ut>al complex was isolated 
by gel filtration (Superdex 200, 10 mM Tris, [pH 8.0], 100 mM NaCt. 
and 4 mM DTT). 

Crystallization and Data Collection for HAUSP 
Crystals were grown by the hanging-drop method by mixing the 
HAUSP protein (residues 208-560) (^^15 mg/mO with an equal vol- 
ume of reservoir solution containing 100 mM Tris, [pH 7.0], and 20% 
PEG1000 (w/v). Small crystals appeared overnight and were used 
as seeds to generate larger crystals from Seleno-Met HAUSP pro- 
tein. The crystals belong to the spacegroup P2i, with a - 75.75 A, 
b » 66.74 A, c >s 76.34 A, and 3 » 95.4*. There are two molecules 
per asymmetric unit Crystals were equilibrated in a cryoprotectant 
buffer containing reservoir buffer plus 20% glycerol (vA^ and were 
flash frozen in a cold nitrogen stream at —1 70*'C. The native and 
MAD date set was collected at NSLS beamllnes X-25 and X-12C, 
respectively, and processed using the software Deruo and Scale- 
pack (OtwinowsM and Minor. 1997). 

Stnicture Determination 

The structure was determined by multiple anomalous dispereton 
(Table 1). Data collected at three wavelengths were treated with 



SOLVE (Tenvilliger and Berendzen, 1996) and 18 selenium atoms 
per asymmetric unit were located. These selenium positions were 
further refined using MLPHARE (CCP4, 1994). Initial MAD phases, 
with a mean figure of merit of 0.63 at 2.6 A resolution, were extended 
to 2.3 A and improved with solvent flattening and histogram match- 
ing using DM (CCP4, 1994). A model was built using O (Jones et 
aL, 1991) and refined at 2.3 A resolution using CNS (Brunger et al., 
1 998). The final refined model contains two molecules. One molecule 
contains residues 208-41 0, 418-601 , and 509-554. The other mole- 
cute contains residues 208-410, 418-601, and 509-555. 



Crystallization and Structure Determination 
of the HAI^P-Ubal Complex 

Crystals were grown by the hanging-drop method by mixing the 
complex ('>'10 mg/mQ with an equal volume of reservoir solution 
containing 1 00 mM citrate, [pH 5.5), and 20% PEG3000. The crystals 
belong to the spacegroup P2,2,2|, with a - 99.5 A. b 101.2 A, 
and c » 141.3 A. The native date set was collected at NSLS X-25 
and processed using the software Denzo and Scalepack (Otwinow- 
ski and Minor, 1997). 

The structure was determined t>y molecular replacement, using 
the software AMoRe (Navaza, 1994). The coordinates of HAliSP 
were used for rotatioruil research against all reflections between 15 
and 3j0 A in the native data set. The top 50 solutions from the 
rotational search were individually used for a subsequent transla- 
tional search, which yielded two excellent solutions. This model was 
examined with the program O (Jones et ai., 1991). Refinement by 
the program CNS (Brunger et al.. 1 998), against the 2.3 A native 
data set, allowed visualization of an CKJditlonal HAUSP fragment. A 
model was built with the program O and refined further by simulated 
annealing. The final atomic model contains two copies of the 
HAUSP-Ubal complex and an isolated HAUSP protein. Both com- 
plexes contain HAUSP residues 208-554 and Ubal residues 1-76. 
The isolated HAUSP protein contains residues 208-^10 and 
418-654. 



In Vitro Deubiquitination Assays 

The following ubiquitin conjugates were used as substrates: ubiqui- 
tin-7-amido-4-methylcoumarin (ubiquitin- AM C; from Boston Bio- 
chem); ubiquitin fused to the N terminus of unfolded and lucifer 
yellow (LY)-labeled chicken ovomucoid domain I (ut>iquitin-OM: Yao 
and Cohen, 2002); K48-linked diubiquitin (Chen and Pickart. 1990), 
and ubiqultinated p53 (U et aL, 2002). Quantitative assays were 
done by Incubation of substrate with HAUSP at 37X in 50 mM [pH 
8.0] HEPES, 50 mM NaCI, 5 mM DTT, 1 mM EDTA, and 0.1 mg/ml 
ovalbumin. Ubiquitin-AMC cleavage was monitored fluorometrically 
(Dang et al., 1998), and diubiquitin cleavage was determined by 
cation-exchange HPLC (Lam et al., 1997e^. The disassembly of fluo- 
rescent LY-labeled ubiquitin-OM was visualized and quantified with 
a cooled CCD camera system (BioChemi System, UVP Biolmaging) 
after separation by SOS-PAGE. To survey the effects of various 
mutations in HAUSP, an equal amount (0.5 >Lg) of the wild-type or 
mutant HAUSP proteins (208-^0) was incubated with 2 of the 
diubiquitin substrate at 37°C In 30 ^.1 reaction buffer containing 25 
mM Tris, [pH 8.0], 100 mM NaCI, 100 |ig/ml BSA. and 2 mM DTT. 
The reactton was stopped by the addition of 25 itl 2x SDS sample 
buffer and analyzed by SDS PAGE. 



An Interaction Assay by Gel RItration 

Gel filtratton was employed to examine the Interaction between p53 
and HAUSP. The details were as described (Wu et al., 2001). 
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